
************************************************************************
Replication Package for
"Learning from Unincentivized and Incentivized Communication: A Randomized Controlled Trial in India"
by Yonas Alem and Eugenie Dugoua
************************************************************************

Main raw datasets (provided by the local agency):
./data_seed.csv: contains responses by the seeds 
./data_communication.csv: contains responses by peers in the communication group
./data_control.csv: contains responses by peers in the control group
./data_network.csv: contains responses by peers in the network group

Additional raw dataset (downloaded from Google trends):
./2014-today_solartermsIndiaweekly.csv: contains the weekly frequency of terms such as "solar lantern" in Google searches in India.
./comparedtokerosene.csv: contains the weekly frequency of terms such as "solar lantern" and "kerosene" in Google searches in India.


TO REPLICATE THE PAPER:

Step 1: Data Cleaning
The Stata script cleaning.do takes as inputs the raw datasets above and outputs the clean datasets: sample_data.dta and seed_data.dta

Step 2: Data Analysis 
The Stata script analysis.do outputs the tables included in the paper. 
The Stata script analysis_som.do outputs the tables included in the Online appendix. 
The Python script graphs.py outputs the graphs included in the paper and online appendix. 

NB: Before running the analysis scripts, two folders ("Tables" and "Graphs") must be created under the directory. The scripts will save tables and graphs under these two folders. 









